36 research outputs found
The p53HMM algorithm: using profile hidden markov models to detect p53-responsive genes
<p>Abstract</p> <p>Background</p> <p>A computational method (called p53HMM) is presented that utilizes Profile Hidden Markov Models (PHMMs) to estimate the relative binding affinities of putative p53 response elements (REs), both p53 single-sites and cluster-sites. These models incorporate a novel "Corresponded Baum-Welch" training algorithm that provides increased predictive power by exploiting the redundancy of information found in the repeated, palindromic p53-binding motif. The predictive accuracy of these new models are compared against other predictive models, including position specific score matrices (PSSMs, or weight matrices). We also present a new dynamic acceptance threshold, dependent upon a putative binding site's distance from the Transcription Start Site (TSS) and its estimated binding affinity. This new criteria for classifying putative p53-binding sites increases predictive accuracy by reducing the false positive rate.</p> <p>Results</p> <p>Training a Profile Hidden Markov Model with corresponding positions matching a combined-palindromic p53-binding motif creates the best p53-RE predictive model. The p53HMM algorithm is available on-line: <url>http://tools.csb.ias.edu</url></p> <p>Conclusion</p> <p>Using Profile Hidden Markov Models with training methods that exploit the redundant information of the homotetramer p53 binding site provides better predictive models than weight matrices (PSSMs). These methods may also boost performance when applied to other transcription factor binding sites.</p
The PLIN4 Variant rs8887 Modulates Obesity Related Phenotypes in Humans through Creation of a Novel miR-522 Seed Site
PLIN4 is a member of the PAT family of lipid storage droplet
(LSD) proteins. Associations between seven single nucleotide polymorphisms
(SNPs) at human PLIN4 with obesity related phenotypes were
investigated using meta-analysis followed by a determination if these phenotypes
are modulated by interactions between PLIN4 SNPs and dietary
PUFA. Samples consisted of subjects from two populations of European ancestry.
We demonstrated association of rs8887 with anthropometrics. Meta-analysis
demonstrated significant interactions between the rs8887 minor allele with PUFA
n3 modulating anthropometrics. rs884164 showed interaction with both n3 and n6
PUFA modulating anthropometric and lipid phenotypes. In silico
analysis of the PLIN4 3′UTR sequence surrounding the
rs8887 minor A allele predicted a seed site for the human microRNA-522
(miR-522), suggesting a functional mechanism. Our data showed that a PLIN4
3′UTR luciferase reporter carrying the A allele of rs8887 was reduced in
response to miR-522 mimics compared to the G allele. These results suggest
variation at the PLIN4 locus, and its interaction with PUFA as
a modulator of obesity related phenotypes, acts in part through creation of a
miR-522 regulatory site
Xenobiotic metabolizing enzyme gene polymorphisms predict response to lung volume reduction surgery
<p>Abstract</p> <p>Background</p> <p>In the National Emphysema Treatment Trial (NETT), marked variability in response to lung volume reduction surgery (LVRS) was observed. We sought to identify genetic differences which may explain some of this variability.</p> <p>Methods</p> <p>In 203 subjects from the NETT Genetics Ancillary Study, four outcome measures were used to define response to LVRS at six months: modified BODE index, post-bronchodilator FEV<sub>1</sub>, maximum work achieved on a cardiopulmonary exercise test, and University of California, San Diego shortness of breath questionnaire. Sixty-four single nucleotide polymorphisms (SNPs) were genotyped in five genes previously shown to be associated with chronic obstructive pulmonary disease susceptibility, exercise capacity, or emphysema distribution.</p> <p>Results</p> <p>A SNP upstream from glutathione S-transferase pi (<it>GSTP1</it>; p = 0.003) and a coding SNP in microsomal epoxide hydrolase (<it>EPHX1</it>; p = 0.02) were each associated with change in BODE score. These effects appeared to be strongest in patients in the non-upper lobe predominant, low exercise subgroup. A promoter SNP in <it>EPHX1 </it>was associated with change in BODE score (p = 0.008), with the strongest effects in patients with upper lobe predominant emphysema and low exercise capacity. One additional SNP in <it>GSTP1 </it>and three additional SNPs in <it>EPHX1 </it>were associated (p < 0.05) with additional LVRS outcomes. None of these SNP effects were seen in 166 patients randomized to medical therapy.</p> <p>Conclusion</p> <p>Genetic variants in <it>GSTP1 </it>and <it>EPHX1</it>, two genes encoding xenobiotic metabolizing enzymes, were predictive of response to LVRS. These polymorphisms may identify patients most likely to benefit from LVRS.</p
Cataloguing functionally relevant polymorphisms in gene DNA ligase I: a computational approach
A computational approach for identifying functionally relevant SNPs in gene LIG1 has been proposed. LIG1 is a crucial gene which is involved in excision repair pathways and mutations in this gene may lead to increase sensitivity towards DNA damaging agents. A total of 792 SNPs were reported to be associated with gene LIG1 in dbSNP. Different web server namely SIFT, PolyPhen, CUPSAT, FASTSNP, MAPPER and dbSMR were used to identify potentially functional SNPs in gene LIG1. SIFT, PolyPhen and CUPSAT servers predicted eleven nsSNPs to be intolerant, thirteen nsSNP to be damaging and two nsSNPs have the potential to destabilize protein structure. The nsSNP rs11666150 was predicted to be damaging by all three servers and its mutant structure showed significant increase in overall energy. FASTSNP predicted twenty SNPs to be present in splicing modifier binding sites while rSNP module from MAPPER server predicted nine SNPs to influence the binding of transcription factors. The results from the study may provide vital clues in establishing affect of polymorphism on phenotype and in elucidating drug response
Probabilistic Inference of Transcription Factor Binding from Multiple Data Sources
An important problem in molecular biology is to build a complete understanding of transcriptional regulatory processes in the cell. We have developed a flexible, probabilistic framework to predict TF binding from multiple data sources that differs from the standard hypothesis testing (scanning) methods in several ways. Our probabilistic modeling framework estimates the probability of binding and, thus, naturally reflects our degree of belief in binding. Probabilistic modeling also allows for easy and systematic integration of our binding predictions into other probabilistic modeling methods, such as expression-based gene network inference. The method answers the question of whether the whole analyzed promoter has a binding site, but can also be extended to estimate the binding probability at each nucleotide position. Further, we introduce an extension to model combinatorial regulation by several TFs. Most importantly, the proposed methods can make principled probabilistic inference from multiple evidence sources, such as, multiple statistical models (motifs) of the TFs, evolutionary conservation, regulatory potential, CpG islands, nucleosome positioning, DNase hypersensitive sites, ChIP-chip binding segments and other (prior) sequence-based biological knowledge. We developed both a likelihood and a Bayesian method, where the latter is implemented with a Markov chain Monte Carlo algorithm. Results on a carefully constructed test set from the mouse genome demonstrate that principled data fusion can significantly improve the performance of TF binding prediction methods. We also applied the probabilistic modeling framework to all promoters in the mouse genome and the results indicate a sparse connectivity between transcriptional regulators and their target promoters. To facilitate analysis of other sequences and additional data, we have developed an on-line web tool, ProbTF, which implements our probabilistic TF binding prediction method using multiple data sources. Test data set, a web tool, source codes and supplementary data are available at: http://www.probtf.org
Inflammatory Gene Regulatory Networks in Amnion Cells Following Cytokine Stimulation: Translational Systems Approach to Modeling Human Parturition
A majority of the studies examining the molecular regulation of human labor have
been conducted using single gene approaches. While the technology to produce
multi-dimensional datasets is readily available, the means for facile analysis
of such data are limited. The objective of this study was to develop a systems
approach to infer regulatory mechanisms governing global gene expression in
cytokine-challenged cells in vitro, and to apply these methods
to predict gene regulatory networks (GRNs) in intrauterine tissues during term
parturition. To this end, microarray analysis was applied to human amnion
mesenchymal cells (AMCs) stimulated with interleukin-1β, and differentially
expressed transcripts were subjected to hierarchical clustering, temporal
expression profiling, and motif enrichment analysis, from which a GRN was
constructed. These methods were then applied to fetal membrane specimens
collected in the absence or presence of spontaneous term labor. Analysis of
cytokine-responsive genes in AMCs revealed a sterile immune response signature,
with promoters enriched in response elements for several inflammation-associated
transcription factors. In comparison to the fetal membrane dataset, there were
34 genes commonly upregulated, many of which were part of an acute inflammation
gene expression signature. Binding motifs for nuclear factor-κB were
prominent in the gene interaction and regulatory networks for both datasets;
however, we found little evidence to support the utilization of
pathogen-associated molecular pattern (PAMP) signaling. The tissue specimens
were also enriched for transcripts governed by hypoxia-inducible factor. The
approach presented here provides an uncomplicated means to infer global
relationships among gene clusters involved in cellular responses to
labor-associated signals
Cancer Biomarker Discovery: The Entropic Hallmark
Background: It is a commonly accepted belief that cancer cells modify their transcriptional state during the progression of the disease. We propose that the progression of cancer cells towards malignant phenotypes can be efficiently tracked using high-throughput technologies that follow the gradual changes observed in the gene expression profiles by employing Shannon's mathematical theory of communication. Methods based on Information Theory can then quantify the divergence of cancer cells' transcriptional profiles from those of normally appearing cells of the originating tissues. The relevance of the proposed methods can be evaluated using microarray datasets available in the public domain but the method is in principle applicable to other high-throughput methods. Methodology/Principal Findings: Using melanoma and prostate cancer datasets we illustrate how it is possible to employ Shannon Entropy and the Jensen-Shannon divergence to trace the transcriptional changes progression of the disease. We establish how the variations of these two measures correlate with established biomarkers of cancer progression. The Information Theory measures allow us to identify novel biomarkers for both progressive and relatively more sudden transcriptional changes leading to malignant phenotypes. At the same time, the methodology was able to validate a large number of genes and processes that seem to be implicated in the progression of melanoma and prostate cancer. Conclusions/Significance: We thus present a quantitative guiding rule, a new unifying hallmark of cancer: the cancer cell's transcriptome changes lead to measurable observed transitions of Normalized Shannon Entropy values (as measured by high-throughput technologies). At the same time, tumor cells increment their divergence from the normal tissue profile increasing their disorder via creation of states that we might not directly measure. This unifying hallmark allows, via the the Jensen-Shannon divergence, to identify the arrow of time of the processes from the gene expression profiles, and helps to map the phenotypical and molecular hallmarks of specific cancer subtypes. The deep mathematical basis of the approach allows us to suggest that this principle is, hopefully, of general applicability for other diseases